Brief comments on computational issues with multiple imputation
نویسندگان
چکیده
MI is usually performed under the assumption that the mechanism causing the missing data is ‘Missing At Random’. We discuss the practical implications of this elsewhere (Carpenter and Kenward, 2008). Here we note that although this assumption may be plausible, it cannot be verified from the data at hand, and therefore the analysis of a partially observed data set under the MAR assumption can never have the same status as the analysis of the fully observed dataset would have had. Thus, it is helpful to present any analysis carried out on the partially observed data (e.g. using MI) alongside the analysis based on only those units/individuals with no missing data (so called ‘Complete Cases’ (CC)), to see how the conclusions differ. If there should be differences, it is then important, if possible, to provide an explanation for these, as this increases confidence in any conclusions drawn.
منابع مشابه
Accuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)
Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...
متن کاملSelection of Variables that Influence Drug Injection in Prison: Comparison of Methods with Multiple Imputed Data Sets
Background: Prisoners, compared to the general population, are at greater risk of infection. Drug injection is the main route of HIV transmission, in particular in Iran. What would be of interest is to determine variables that govern drug injection among prisoners. However, one of the issues that challenge model building is incomplete national data sets. In this paper, we addressed the process ...
متن کاملچند رویکرد برخورد با مقادیر گمشده متغیرهای کمی و بررسی اثر آنها بر نتایج حاصل از یک کارآزمایی بالینی
Background and Objectives: A major challenge that affects the longitudinal studies is the problem of missing data. Missing in the data may result in the loss of part of the information which reduces the accuracy of the estimator and obtain the results will be biased and inaccurate. Therefore, it is necessary to evaluate the missing data mechanism from a longitudinal research and to consider thi...
متن کاملAn Empirical Comparison of Performance of the Unified Approach to Linearization of Variance Estimation after Imputation with Some Other Methods
Imputation is one of the most common methods to reduce item non_response effects. Imputation results in a complete data set, and then it is possible to use naϊve estimators. After using most of common imputation methods, mean and total (imputation estimators) are still unbiased. However their variances (imputation variances) are underestimated by naϊve variance estimators. Sampling mechanism an...
متن کاملEmpirical Constraints on Computational Theories of Metaphor : Comments on lndurkhya RICHARD
The goal of developing a computational theory of metaphor is an admirable one (Indurkhya, 1986, 1987). Such a theory, however, is adequate only to the extent that it can be embedded into a more comprehensive account of language processing. Empirical investigations have provided some important constraints on how the comprehension and evaluation of metaphorical language may fit into this grander ...
متن کامل